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Abstract 

In order to develop statistical methods for shapes with a tree-structure, 
we construct a shape space framework for tree-like shapes and study met- 
rics on the shape space. This shape space has singularities, corresponding 
to topological transitions in the represented trees. We study two closely 
related metrics on the shape space, TED and QED. QED is a quotient 
Euclidean distance arising naturally from the shape space formulation, 
while TED is the classical tree edit distance. Using Gromov's metric 
geometry we gain new insight into the geometries defined by TED and 
QED. We show that the new metric QED has nice geometric properties 
which facilitate statistical analysis, such as existence and local uniqueness 
of geodesies and averages. TED, on the other hand, does not share the 
geometric advantages of QED, but has nice algorithmic properties. We 
provide a theoretical framework and experimental results on synthetic 
data trees as well as airway trees from pulmonary CT scans. This way, 
we effectively illustrate that our framework has both the theoretical and 
qualitative properties necessary to build a theory of statistical tree-shape 
analysis. 

Keywords: Trees, Tree metric, Shape, Anatomical structure. Pattern match- 
ing, Pattern recognition, Geometry 



1 Introduction 



Tree-shaped objects are fundamental in nature, where they appear, e.g., as 
delivery systems for gases and fluids f^, as skeletal structures, or describing 
hierarchies. Examples encountered in image analysis and computational biology 
39 , vascular systems [5[, shock graphs [2l|31l|36 
1 20] and phylogenetic trees |3l 



are airway trees 24 




scale 



space hierarchies |6l 

Statistical methods for tree-structured data would have endless applications. 
For instance, one could make more consistent studies of changes in airway ge- 
ometry and structure related to airway disease 
computer aided diagnosis and prognosis. 



33 , 42 to improve tools for 
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(a) Edge matching 



(b) Geodesic candidate 



Figure 1: A good metric must handle edge matchings which are inconsistent 
with tree topology. 



Due to the wide range of applications, extensive work has been done in the 



past 20 years on comparison of trees and graphs in terms of matching 18||24[|38 
object recognition |6,19| and machine learning 11,15 28 based on inter-tree 
distances. However, the existing tree-distance frameworks are algorithmic rather 
than geometric. Very few attempts T| [26l[4T have been made to build analogues 
of the theory for landmark point shape spaces using manifold statistics and 
Riemannian submersions 



12,17,32 



There exists no principled approach to 
studying the space of tree-structured data, and as a consequence, the standard 
statistical properties are not well-defined. As we shall see, difficulties appear 
even in the basic problem of finding the average of two tree-shapes. This paper 
fills the gap by introducing a shape-theoretical framework for geometric trees, 
which is suitable for statistical analysis. 

Most statistical measurements are based on a concept of distance. The most 
fundamental statistic is the mean (or prototype) m for a dataset {xi, . . . ,0;^}, 
which can be defined as the minimizer of the sum of squared distances to the 
data points: 



m 



— argmin^ ^ d{x^ x^)^. 



(1) 



This definition of the mean, called Frechet mean^ assumes a space of tree-shapes 
endowed with a distance and is closely connected to geodesies, or shortest 
paths, between tree-shapes. For example, the midpoint of a geodesic from xi 
to X2 is a mean for the two-point dataset {xi^X2}. Hence, if there are multiple 
geodesies connecting xi to X2, with different midpoints, then there will also 
be multiple means. ^4^ a consequence, without (local, generic) uniqueness of 
geodesies, statistical properties are fundamentally ill-posed! 

Thus, geometry enters the picture, and the idea of a geodesic tree-space 
gives a constraint on the possible geometric structure of tree-space. In a shape 
space where distances are given by path length, we must be able to continuously 
deform any given tree-shape into any other by traveling along the shortest path 
that connects the two shapes. The deformation-paths are easy to describe when 
only branch shape is changed, while tree topology (branch connectivity) is fixed. 
Such deformations take place in portions of tree-space where all trees have the 
same topological structure. It is more challenging to describe deformation- 
paths in which the tree-topological structure is changed, for instance through a 
collapsed internal branch as in fig. |l(b)[ We model topologically intermediate 
trees as collapsed versions of trees with differing tree topology, and glue the 
portions of tree-space together along subspaces that correspond to collapsed 
trees, as in fig. |2j As a consequence, tree-space has self intersections and is not 
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Figure 2: Continuous transitions in tree topology: Tree-shapes with different 
tree topology live in different components of tree-space. Paths in different com- 
ponents can terminate in identical collapsed tree-shapes. To identify these, the 
tree-space components are glued together along subsets with the collapsed tree- 
shapes. 




^ X 

Figure 3: Tree-like shapes are encoded by an ordered binary tree and a set of 
attributes describing edge shape. 



smooth, but has singularities! 

The main theoretical contributions of this paper are the construction of 
a mathematical tree-shape framework along with a geometric analysis of two 
natural metrics on the shape space. One of these metrics is the classical Tree 
Edit Distance (TED), into which we gain new insight. The second metric, called 
Quotient Euclidean Distance (QED), is induced from a Euclidean metric. Using 



Gromov's approach to metric geometry 13 , we show that QED generically 
gives locally unique geodesies and means, whereas finding geodesies and means 
for TED is ill-posed even locally. We explain why using TED for computing 
average trees must always be accompanied by a carefully engineered choice of 
edit paths in order to give well-defined results; choices which can yield average 



trees which are substantially different from the trees in the dataset 37 . The 
QED approach, on the contrary, allows us to investigate statistical methods for 
tree-like structures which have previously not been possible, like different well- 
defined concepts of average tree. This is our motivation for studying the QED 
metric! 

The paper is organized as follows: In section we discuss related work. 
The tree-space is defined in section [2, and the statistical properties of tree- space 
are analyzed in section |3] In section 4j we discuss how to overcome the compu- 
tational complexity of both metrics, and present a simple QED approximation. 
In section |5) we illustrate the properties of QED by computing geodesies and 
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different types of average tree for synthetic planar data trees as well as by com- 
puting QED means for sets of 3D airway trees from human lungs. 



1.1 Related work 

Metrics on sets of tree-structured data have been studied by different research 
communities for the past 20 years. The best-known approach is perhaps Tree 
Edit Distance (TED), which has been used extensively for shape matching and 



recognition based on medial axes and shock graphs [19j[35,37 . TED and, more 
generally, graph edit distance (GED), are also popular in the pattern recognition 
community, and are still used for distance-based pattern recognition approaches 



to trees and graphs TH[28 . The TED and GED metrics will nearly always have 



infinitely many shortest edit paths, or geodesies, between two given trees, since 
edit operations can be performed in different orders and increments. As a result, 
even the problem of finding the average of two trees is not well posed. With 
no kind of uniqueness of geodesies, it becomes hard to meaningfully define and 
compute average shapes or modes of variation. This problem can be solved to 
some extent by choosing a preferred edit path 11,28,37 , but there will always 
be a risk that the choice has negative consequences in a given setting. Trinh 
and Kimia 37 face this problem when they use TED for computing average 
medial axes using the simplest possible edit paths, leading to average shapes 
which can be substantially different from most of the dataset shapes. 

Statistics on tree-shaped objects receive growing interest in the statistical 



community. Wang and Marron 41 study metric spaces of trees and define a 
notion of average tree called the median- mean as well as a version of PGA, 
which finds modes of variation in terms of tree-lines^ encoding the maximum 
amount of structural and attributal variation. Aydin et al. 1 extend this work 
by finding efficient algorithms for PGA. This is applied to analysis of brain 
blood vessels. The metric defined by Wang and Marron does not give a natural 
geodesic structure on the space of trees, as it places a large emphasis on the 
tree-topological structure of the trees. The metric has discontinuities in the 
sense that a sequence of trees with a shrinking branch will not converge to a 
tree that does not have that branch. Such a metric is not suitable for studying 
trees with continuous topological variations and noise, such as anatomical tree- 
structures extracted from medical images, since the emphasis on topology makes 
the metric sensitive to structural noise. 

A different approach is that of Jain and Obermayer 15 , who study metrics 
on attributed graphs, represented as incidence matrices. The space of graphs is 
defined as a quotient of the Euclidean space of incidence matrices by the group 
of vertex permutations. The graph- space inherits the Euclidean metric, giving it 
the structure of an orbifold. This graph-space construction is similar to the tree- 
space presented in this paper in the sense that both spaces are constructed as 
quotients of a Euclidean space. The graph-space framework does not, however, 
give continuous transitions in internal graph-topological structure, which leads 
to large differences between the geometries of the tree- and graph-spaces. 

Trees also appear in genetics. Hillis et al. 14 visualize large sets of phylo- 
genetic trees using multidimensional scaling. Billera et al. 3 have invented a 
phylogenetic tree- space suitable for geodesic analysis of phylogenetic trees, and 
Owen and Provan 27 have developed fast algorithms for computing geodesies 
in phylogenetic tree-space. Nye [26. has developed a notion of PGA in phyloge- 
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netic tree-space, but is forced to make strict assumptions on possible principle 
components being "simple lines" for the sake of computability. Phylogenetic 
trees are not geometric, and have fixed, labeled leaf sets, making the space of 
phylogenetic trees much simpler than the space of tree-like shapes. 

We have previously [9][To] studied geodesies between small tree-shapes in 
the same type of singular shape space as studied here, but most proofs have 
been left out. In 8 , we study different algorithms for computing average trees 
based on the QED metric. This paper extends and continues [o], giving proofs, 
in-depth explanations and more extensive examples illustrating the potential of 
the QED metric. 



2 The space of tree-like shapes 

Let us discuss which properties are desirable for a tree-shape model. As previ- 
ously discussed, we require, at the very least, local existence and uniqueness for 
geodesies in order to compute average trees and analyze variation in datasets. 
When geodesies exist, we want the topological structure of the intermediate 
trees along the geodesic to reflect the resemblance in structure of the trees be- 
ing compared. In particular, a geodesic passing through the trivial one-vertex 
tree should indicate that the trees being compared are maximally different. Per- 
haps more importantly, we would like to compare trees where the desired edge 



matching is inconsistent with tree topology, as in fig. 1(a) Specifically, we would 



like to find geodesic deformations in which the tree topology changes when we 



have such edge matchings, for instance as in fig. |l(b) 



2.1 Representation of trees 

In this paper we shall work with two different tree- spaces: A tree- space X, 
which is the space of all trees of a certain size, and a subspace Z C X, which is 



a restricted space of trees, whose exact definition is flexible (see definition 11). 
The large tree-space X is the natural space for geometric trees. However, the 
available mathematical tools only allow us to prove our results locally, where 
the locality assumptions become very strict in X. Using a set of natural as- 
sumptions, we can restrict to a tree-space Z where our results hold in larger 
regions of tree-space. This is discussed in detail in section [XB] We also believe 



that our results hold in X, as described in conjecture 41 

In this paper, a "tree-shape" is an embedded tree in or R^, and consists of 
a series of edge embeddings, glued together according to a rooted combinatorial 
tree. Tree-shapes are invariant to translation, but our definition of tree-shape 
does not remove scale and rotation. However, tree-shapes can always be aligned 
with respect to scale and rotation prior to comparison, if this is important. 

Any tree-like shape is represented as a pair (^,x) consisting of a rooted, 
planar tree £^ with edge attributes x. In ^ = (F, £^,r), V is the vertex set, 
E C V X V is the edge set, and r is the root vertex. The tree 3^ describes 
the tree-shape topology, and the attributes describe edge shape, as illustrated 
in fig. [sj The shape attributes, represented by a point x = (xg) in the prod- 
uct space rieG^; ^ concatenation of edgewise attributes from an attribute 
space A. The attributes could, e.g., be edge length, landmark points or edge 
parametrizations. In this work, we mostly use open curves translated to start 
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(a) (b) 



Figure 4: (a) Higher- degree vertices are represented by collapsing internal edges 
(dotted line = zero attribute = collapsed edge). We identify those tree pre- 
shapes whose collapsed structures are identical: The pre-shapes xi and X2 rep- 
resent the same tree-shape x. (b) The tree deformation shown in the top row 
does not correspond to a path in X, as the two representations of the interme- 
diate tree are found at distinct points in X. 



at the origin, described by a fixed number of landmark points along the edge. 
Thus, throughout the paper, the attribute space A is (R^)^ where d = 2 or 3 
and n is the number of landmark points per edge. Collapsed edges are repre- 
sented by a sequence of origin points. In some illustrations, we shall use scalar 
attributes for the sake of visualization, in which case A = R. 

In order to compare trees of different sizes and structures, we need to rep- 
resent them in a unified way. We describe all shapes using the same tree £^ to 
encode tree topology. By choosing a sufficiently large we can represent all 
the trees in our dataset by filling out with empty (collapsed) edges. We call ^ 
the maximal tree. 

We model tree-shapes using binary maximal trees Tree-shapes which are 
not binary are represented by the binary tree ^ in a natural way by allowing 
constant, or collapsed, edges, represented by the zero scalar or vector attribute. 
In this way an arbitrary attributed tree can be represented as an attributed bi- 
nary tree, see fig. |4(a)| This is geometrically very natural. Binary trees are 
geometrically stable in the sense that small perturbations of a binary tree-shape 
do not change the tree-topological structure of the shape. Conversely, a trifur- 
cation or higher-order vertex can always be turned into a series of bifurcations 
sitting close together by an arbitrarily small perturbation. In our represen- 
tation, thus, trifurcations are represented as two bifurcations sitting infinitely 
close together, etc. 

Trees embedded in the plane have a natural edge order induced by the left- 
right order on the children of any edge. We say that a tree is ordered whenever 
each set of sibling edges in the tree is endowed with such a total order. Con- 
versely, an ordered combinatorial tree always has a unique, implicit embedding 
in the plane where siblings are ordered from left to right. For this reason we use 
the terms "planar tree" and "ordered tree" interchangingly. We initially study 
metrics on the set of ordered binary trees; later we use them to induce dis- 
tances between unordered trees by considering all possible orders. This allows 
us to model trees in R^. Considering all orders leads to potential computational 
challenges, which are discussed in section [4] 

In order to build a space of tree-shapes, fix an ordered maximal binary tree 
£^ with edges which encodes the connectivity of all our trees. Any attributed 
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Figure 5: The simplest non-trivial tree-shape space, consisting of ordered trees 
with two edges with scalar attributes. Along the x- and ?/-axes are trees with 
a single branch. For each real number a, the tree-shape found at T' = (a, 0) 
is also represented at T = (0,a). We build the tree-shape space by gluing the 
different representations of the same tree-shapes (e.g., T and G X) together, 
obtaining the shape space shown on the right. Note the path from Ti to T2 
through T in X on the right; the corresponding path in the pre-shape space X 
involves a "teleportation" between the representations T and T' of T. 



tree T is now represented by a point x = {xe)eeE in X = neG£;(^^)^' where the 
coordinate Xe describes the shape of the edge e. Since we allow zero-attributed 
edges, as discussed above, some tree-shapes will be represented by several points 
in X (fig. |4(a)[ ). As a result, some natural tree-deformations are not found as 
continuous paths in X. In figs. 4(b) andjsj the paths in X corresponding to the 



indicated deformations require a "teleportation" between two representations 
of the intermediate tree-shape. We tackle this by using a refined tree-shape 
space X, where different representations are identified as being the same point. 
The original space X is called the tree pre-shape space, analogous to Kendall's 



terminology 17 



2.2 The singular space of ordered tree-shapes 

We go from pre-shapes to shapes by identifying those pre-shapes which define 
the same shape. 

Consider two ordered tree-shapes with collapsed edges. Replace their binary 
representations by collapsed representations, where the zero attributed edges 
have been removed. The orders of the original trees induce well-defined orders 
on the collapsed trees. We say that two ordered tree-shapes are the same when 
their collapsed ordered, attributed versions are identical, as in fig. 4(a) Tree 
identifications come with an inherent bijection of subsets of the edge set E: If 
we identify x^y e X = Yl^i^^)^^ denote by 

Ei = {ee E\xe ^ 0}, (2) 
E2 = {ee E\y, / 0}. (3) 

the sets of non-collapsed edges with non-zero attributes. The identification of 
X and y is equivalent to an order preserving bijection cp: Ei ^ £^2, identifying 
those edges that correspond to the same edge in the collapsed tree-shape. Vary- 
ing the attributes Xg, (p spans a family of tree-shapes with fixed topology and 
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several representatives. Thus, the edge sets Ei and E2 induce hnear subspaces 

IVi = {xe X\xe = if e ^ £;i} (4) 
V2 = {xeX\xe = 0ife^ E2} (5) 

of X where, except for on the axes, the topological tree structure is constant. 
The tree-shapes represented in Vi are exactly the same as those represented in 
V2. The bijection (p induces a bijection ^: Vi ^ V2 given by <l>: (xg) 1-^ (^(^(e))? 
which connects each representation x G Vi to the representation ^{x) G V2 of 
the same shape. Note that the Vi are spanned by axes in X. 

Define a map ^ for each pair of identified tree- structures, and form an equiv- 
alence on X by setting x ~ ^{x) for all x and <l>. For each x e X, x is the 
equivalence class {x^ e X\x^ ^ x}. The quotient space 

X = {X/ = {x\xe X} (6) 

of equivalence classes x is the space of ordered tree-like shapes. 

Quotient spaces are standard constructions from topology and geometry, 
where they are used to glue spaces together ^ chapter 1.5]. The geometric 
interpretation of the identification in the tree-space quotient is that we fold and 
glue the pre-shape space space along the identified subspaces; i.e., when xi ~ X2, 
we glue the two points xi and X2 together. See fig. [5] for an illustration. 



2.3 Metrics on the space of ordered trees 

Given a metric d on the Euclidean pre-shape-space X = Yl^^^O^^)^ ^ induce 
the standard quotient pseudometric [4j J on the quotient space X = X/ ~ by 
setting 

d{x,y) = inf |^ ex.yi^ Xi^i.yk G ^| . (7) 

This corresponds to finding the optimal path from x to ^, consisting of any 
number k of concatenated Euclidean lines, passing through k — 1 identified 
subspaces, as shown in fig. [5] It is clear from the definition that the distance 
function d is symmetric and transitive. It is, however, an infimum, giving a risk 
that the distance between two distinct tree-shapes is zero, as occurs with some 
intuitive shape distance functions 25 . This is why d is called a pseudoiaeiiic^ 
and it remains to prove that it actually is a metric; i.e., that d{x^ y) = ^ implies 
x = y. 

We prove this for two specific metrics on X, which come from two different 
ways of combining individual edge distances: The metrics di and d2 on X = 
Yleesi^^)^ are the norms 

||x-y||i = ^||Xe-ye||, (8) 

eeE 



\x-y\\2= . J2\\Xe-ye\\'. (9) 



eeE 



From now on, d and d will denote either the distance functions di and d 



or d2 and d2. We prove the following theorem in section 2.6 
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Theorem 10 The distance function d is a metric on X, which is a contractihle, 
complete, proper geodesic space. 

Thus, given any two trees, we can always find a geodesic between them in both 
metrics di and d2- Q 

We may often want to restrict to a subset of the large tree-space. 

Definition 11 (Restricted tree-shape space) Consider a subset Z C X, 
which only contains all representations of trees of certain restricted topologies, 
defined by collapsed subtrees of the maximal tree The i^^ collapsed subtree 
of 3^ is characterized by a subset Ei C E consisting of the edges in the maximal 
tree ^ which are not collapsed. Associated to it is a linear subspace Zi = 
YleeE cEO^^y C neG£;(^"^)^ ~ ^ Containing representations of all the trees of 
this particular topology. We include all representations of each tree topology, 
and obtain a restricted preshape space 

i eeE 

containing all the trees that have of one of the considered topologies. The 
equivalence relation on X restricts to an equivalence relation r^z on Z, from 
which we obtain a restricted tree-shape space Z = Z/ ^z= p{Z) C X. The 
metric d on X induces a metric dz on Z which induces a quotient pseudometric 
dz on Z. 

Example 12 Denote by Z the space of all trees in X with n leaves, now Z is 
the space of tree-shapes with n leaves. 

Remark 13 Note that the space of tree-shapes on n leaves in is different from 
the Billera- Holmes- Vogtmann (BHV) space 3 of trees with vector attributes, 
because in the BHV space, geodesies will always deform leaves onto leaves, 
whereas in Z, leaves can be transformed to non-leaf branches by a geodesic. 

Even in the restricted tree-shape space, we obtain an induced metric: 

Corollary 14 The pseudometric d^ on Z is a metric, and {Z^dz) a con- 
tractihle, complete, proper metric space. 

Proof. First, we show that the pseudometric is a metric. The pseudometric in 
equation ^ defines the distance y) as the infimum of lengths of paths in 
X connecting x and y. Any path in Z is also a path in X, so if dz{x^y) = 0, 
then d{x^ ^) = as well, so x = y. 

as in sec- 



The proofs of the other claims follow the proof of theorem 10 



tionEJ^ □ 



2.4 From ordered to unordered trees 

The world is not two-dimensional, and for most applications it is necessary to 
study embedded trees in R^. The main difference from the ordered case is that 
trees in have no canonical edge order. The left-right order on children of 
planar trees gives an implicit preference for edge matchings, and hence reduces 

■"^It can be shown that for any metric d on X, the induced pseudometric Jon X is a metric. 
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the number of possible matches. When we no longer have this preference, we 
consider all orderings of the same tree and choose orders which minimize the 
distance. 

We define the space of (unordered) tree-like shapes in 3D as the quotient 
X = X/G, where G is the group of reorderings of the maximal binary tree 
The metric J on X induces a quotient pseudometric d on X. Again, we can 
prove: 

Theorem 15 For d induced by either di or d2, the function d is a metric and 
the space (X, d) is a contractihle, complete, proper geodesic space. 

The same result holds for restricted tree-spaces with several restricted tree 
topologies: Let Z C X be a subspace containing only trees of certain restricted 
topologies, as in definition [771 '^hich is saturated with respect to the reordering 
group^ G. For the corresponding restricted tree-space Z = Z /G (Z X , the 
quotient pseudometric dz is a metric and the space (Z^dz) is a contractible, 
complete, proper geodesic space. 

Note that G is a finite group, which means that X is locally well-behaved 
almost everywhere. In particular, off fixed-points for the action of reorderings 
^ G G on X, the projection p: X ^ X is a local isometry, i.e., it is distance 
preserving within a neighborhood. Hence, the geometry from X is preserved off 
the fixed points. Geometrically, a fixed point in X is an ordered tree-shape where 
a reordering g of certain branches does not change the ordered tree-shape; that 
is, some pair of sibling edges must have the same shape attributes. In particular, 
the fixed points are non-generic because they belong to the lower-dimensional 
subset of X where two sibling edges have identical shape. Theorem 



15 



can 



be proved using standard results on compact transformation groups along with 



similar techniques as for theorem 10 



While considering all different possible orderings of the tree is easy from the 
point of view of geometric analysis, in reality this becomes a computationally 
impossible task when the size of the trees grow beyond a few generations. In 
real applications we can, however, efficiently reduce complexity using heuristics 
and approximations, as discussed in section [4| 

2.5 Geometric interpretation of the metrics 

It follows from the definition that the metrics di and di coincide with the classi- 
cal tree edit distance (TED) metric for ordered and unordered trees, respectively. 
In this way the abstract, geometric construction of tree- space gives a new way 
of viewing the intuitive TED algorithm. 

The metrics J2 and J2 are descents of the Euclidean metric on X, and 
geodesies in this metric are concatenations of straight lines in fiat regions. In 



section 3.7 we compare the two metrics using examples. 



2.6 Proof of theorem [To] 

We now pass to the proof of theorem [lO] The rest of the article is independent 
of this section, and during the first read, the impatient reader may skip to 



is saturated if, for each tree topology appearing in Z, all reorderings of the same tree 
topology also appear in Z. Equivalently, Z = p~^(p(Z)), where p: X ^ X/G 
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section [Sj However, while the proof is technical, theorem 10 is a fundamental 
building block for the shape space framework. We shall assume that the reader 
has a good knowledge of metric geometry or general topology [4 . It is crucial 
for the proof that we are only identifying subspaces of the Euclidean space X 
which are spanned by Euclidean axes, and these are finite in number. This 
induces a well-behaved projection p: X — > X, which carries many properties 
from X to X. 



2.6.1 Precise shape space definition 

We say that ordered tree-shapes whose (collapsed) ordered structure is the same, 
belong to the same combinatorial tree-shape type. For each combinatorial type 
of ordered tree-shape Cj (j = 1, . . . , K) which can be represented by collapsing 
edges in the maximal tree ^ = (V, £^,r, <), there is a family Ej of subsets 
of which induce that particular type Cj when we endow the edges in Ej 
(i = l,...,nj) with nonzero attributes and leave all other edges with zero 
attributes. These subsets are characterized by the properties 

a) the cardinality \Ej \ = \Ej\ for ah z = 1, . . . , n^, j = 1, . . . , K. 

b) there is a depth-first order on each Ej induced by the depth-first order 
on E^ such that the ordered, combinatorial structure defined by any Ej 
coincides with that defined by Ej. 

That is, the subset Ej for any i lists the set of edges in ^ which have nonzero 
attributes for the i^^ representation of any shape of type Cj. Corresponding to 
each Ej is the linear subspace of X given by 

Vj = I {xe) e l[ {R^r\xe = if e ^ £;j I (16) 

I eeE J 

and by condition b) we can define isometrics (j)) : V- Y\eGE^ forgetting 

J J 3 

the zero entries in V- and keeping the depth-first coordinate order. We generate 
the equivalence ~ on X by asking that z w whenever (l)]{z) = (l)^j{w) for 
some i, j, We now define the space of ordered tree-like shapes as the quotient 
X = X/ ~, and define the quotient map p: X ^ X. 



2.6.2 The pseudometric is a metric 

It is clear from the definition that the distance function d defined in ([7| is 
symmetric and satisfies the triangle inequality, which makes it a pseudometric. 

Proposition 17 Let d denote di or ^2- The pseudometric d is a metric on X. 

Proof. It suffices to show that di{x,y) = implies x = y {i = 1^ 2). Moreover, 
it is also easy to show that di{x,y) > d2{x,y) for any x,^ G X, so it suffices to 
show that ^2(^5 y) = ^ implies x = y. Hence, from now on, write d for d2^ and 
assume that d{x^ y) = ^ for two tree-shapes x and y. 
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Choose e > such that 



e ^ mm < 



Ikell >0, 
bell >0, 
\Xe - Xe\\ > 0, 
be -yell >0, 



y = iVe) e y 



(18) 



that is, e is smaher than the size of any of the non-zero edges in x and y. 

We may assume that x^y G [j^^ jVJ since otherwise we may assume by sym- 
metry that X = {x} and d{x^ y) > mm{d{x^ y), d{x^ [j- ■ V-)} > 0. 

Denote by Xj the image of Vj under the quotient projection p: X ^ X for 
any i. 

We may assume that x and y belong to the same identified subspace; that 
is, there exist i^j such that 



xnvi ^9, ynvj ^ 



since otherwise. 



^,xny/7^0}) =0 for all 



Since y is a finite set, and IJ{^*k 7^ ^} is a closed set, (20) implies 
d{y,\J{Vi\xnVi^^)>0. 



(19) 



(20) 



(21) 



In this case, the path wih have to go through some V~ which does not contain 



points equivalent to y, and 

d{y,\J{Vi\ynVi = n)>e, 



(22) 



since in order to reach |J{^*I^ ~ need to remove edge attributes 

which are nonzero in ^, and e <C ||^e|| for all ye ^ 0. Thus, eq. 19] holds, and in 
fact, it holds for all the intermediate path points Xi from eq. [7 

But if the path points stay in Xj, then the path consists of shifting and 
changing the nonzero edge attributes of the trees in question. This will only 
give a sum < e if the trees are identical and the path is constant. □ 



2.6.3 Topology of the space of tree-like shapes 



Here, we prove the rest of theorem 10, namely that the tree-shape space {X,d) 
is a complete, proper geodesic space, and X is contractible. First, we note 
that although X is not a vector space, there is a well-defined notion of size for 
elements of X, induced by the norm on X: 



Lemma 23 Note that if x ^ y, we must have \\x\\ = \\y\\; hence we can define 
\\x\\ := \\x\\. 

Proof. The equivalence is generated recursively from the conditions x ^ y 
whenever either x = y, indicating \\x\\ = \\y\\; or (j)'j{x) = (l)\{y)^ indicating 
1 1^1 1 — 1 1 (^) 1 1 — ll^fc(^)ll — \\y\\ since the (j) are isometrics. Hence, the lemma 



holds by recursion. 



□ 
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We will prove that (X, d) is a proper geodesic space using the Hopf-Rinow 
theorem for metric spaces 4 , which states that every complete locally compact 
length space is a proper geodesic space. A length space is a metric space in 
which the distance between two points can always be realized as the infimum 
of lengths of paths joining the two points. Note that this is a weaker property 
than being a geodesic space, as the geodesic joining two points does not have to 
exist; it is enough to have paths that are arbitrarily close to being a geodesic. 
It follows from 4, chapter I lemma 5.20] that (X, J) is a length space for any 
metric d on X where J is a metric. 

To see that the tree-shape space is locally compact, note that the projection 
p: X ^ X is finite-to-one, so any open subset U of X has as pre-image a 
finite union UiLi of open subsets of X, such that didim{Ui) = diam(/7) and 
p(|J- Ui) = U is compact whenever U is bounded. 

We also need to prove that (X, d) is complete: 

Proposition 24 Let d denote either of the metrics di and d2- The shape space 
(X, J) is complete. 

The proof needs a lemma from general topology: 

Lemma 25 7, chapter XIV, theorem 2.3] Let (Y^d) be a metric space and 
assume that the metric d has the following property: There exists e > such 
that for all y eY the closed ball B{y, e) is compact. Then d is complete. 

Using the projection p: X ^ X, we can prove: 
Lemma 26 Bounded closed subsets of X are compact. 



Proof. Since lemma 23 defines a notion of size in X, any closed, bounded 
subspace C in X is contained in a closed ball 5j(0, R) in X for some > 0, 
where is the image p(0) G X. Since ||x|| = ||x||, it follows that 5 j(0, i?)) = 
Bd{0, R), which is a closed and bounded ball in X. Since closed, bounded subsets 
of X are compact 5^(0, R) is compact. By continuity of p, 5j(0, R)) is compact. 
Then C is compact too. □ 



It is now very easy to prove proposition 24 



Proof (Proof of proposition 24). By lemma 26, all closed and bounded sub- 



sets of X are compact, but then by lemma 25 the metric d must be complete. □ 



Using the Hopf-Rinow theorem 4, chapter I, proposition 3.7] we thus prove 
that (X, d) is a complete, proper geodesic space. We still miss contractibility: 

Lemma 27 Let B be a normed vector space and let ~ be an equivalence on B 
such that a ^ b implies t- a ^ t-b for alltGM. Then B = B / ^ is contractible. 

Proof. Define a map : 5 x [0, 1] ^ 5 by setting H{x^t) = t ■ x. Now H is 
well defined because of the condition on ~, and H{x^O) = V x G 5 so is a 
homotopy from id^ to the constant zero map. □ 



Combining the results of section [2^ we see that the proof of theorem [TO 
complete. 



IS 
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3 Curvature in the tree-shape space 



Having proved theorem [Toj we may now pass to studying the geometry of the 
tree-shape space through its geodesies. Uniqueness of geodesies and means is 
closely connected to the geometric notion of curvature^ a concept which fun- 
damentally depends on the underlying metric. Using methods from metric ge- 
ometry [4, 13 we shall investigate the curvature of the tree-shape space with 



the QED and TED metrics. Using curvature, we obtain well-posed statistical 
methods for QED. 

The next theorem states that in the tree-shape space endowed with the QED 
metric, any randomly selected point has a corresponding neighborhood within 
which the tree-space has non-positive curvature. We shall use this fact to show 
that datasets within that same neighborhood have unique averages. 

Theorem 28 i) Endow X with the QED metric d2- A generic point x e X 
has a neighborhood U C X in which the curvature is non-positive. At 
non-generic points, the curvature of {X,d2) is -\-oo, or unbounded from 
above. 

a) Endow X with the TED metric di. The metric space {X^di) does not 
have locally unique geodesies anywhere, and the curvature of {X^di) is 
+00 everywhere. 

Claims i) and ii) also hold in the subspace Z (Z X containing only trees of 
certain restricted topologies, as defined in definition [77} 

Here, local uniqueness is defined as uniqueness within a sufficiently small neigh- 
borhood. 



3.1 Genericity 

Genericity is a key concept in this paper. Many of our results, e.g., uniqueness 
of means, do not hold in general, but they do hold for a randomly chosen dataset 
with respect to natural probability measures. 

Definition 29 (Generic property) A generic property in a metric space (X, d) 
is a property which holds on an open, dense subset of X. 

In the tree-spaces X, X, Z and Z, one interpretation is that generic proper- 
ties hold almost surely, or with probability one, with respect to natural probabil- 
ity measures. Thus, for a random tree-shape, we can safely assume that it satis- 
fies generic properties, e.g., that the tree-shape is a binary tree. A non-generic 
property is a property whose "not happening" is generic. This is similarly in- 
terpreted as a property that may not hold for randomly selected tree-shapes. A 
detailed discussion of the relation between genericity and probability is found 
in Appendix A. 

One common misconception is that the term "generic tree" refers to a par- 
ticular class of trees with a particular generic property. It is important to note 
that many different properties, which do not necessarily happen on the same 
subsets of tree-space, may all be generic at the same time. However, any finite 
set of generic properties will all happen on an open, dense subset. 

Proposition 30 Tree-shapes that are truly binary (i.e. their internal edges are 
not collapsed) are generic in the space of all tree-like shapes. 
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geodesic 1 

(internal structural 
change -- side 
branches pass 
each other) 



geodesic 2 

(external structural 
change -- side 
branches disappear) 




Figure 6: Examples of geodesic deformations: geodesic 1 goes through a tree 
with a trifurcation, while geodesic 2 does not have internal structural transitions. 



Proof. Let T be a tree-shape in X or X which is not truly binary, represented 
by a maximal binary tree 3^ . By adding arbitrarily small noise to the zero 
attributes on edges of 3^ ^ we obtain truly binary tree-shapes T' which are ar- 
bitrarily close to T. Thus, the set of full truly binary tree-shapes in X or X is 
open and dense. Hence, truly binary tree-shapes are generic both in X and in 
X. □ 

The essence of the proposition is that binary tree-shapes are generic, but 
that does not mean that non-binary trees do not need to be considered! While 
non-binary trees may not appear as randomly selected trees, they do appear in 
geodesies between randomly selected pairs of trees, as in fig.[6j Non-binary tree- 
like shapes also appear as samples in real-life applications, e.g., when studying 
airway trees. However, we interpret this as an artifact of resolution rather than 
true higher-degree vertices. For instance, airway extraction algorithms record 
trifurcations when the lengths of internal edges are below certain threshold 
values. 



3.2 Curvature in metric spaces 

In order to understand and prove theorem [28| we need a definition of curvature 
in metric spaces. In spite of its simplicity and elegance, this concept from metric 
geometry is novel in computer vision. We shall spend a little time introducing 



it before proceeding to prove theorem 28 in section 3.3 



Since general metric spaces can have all kinds of anomalies, the concept of 
curvature in such spaces is defined through a comparison with spaces that are 
well understood. More precisely, the metric spaces are studied using geodesic 
triangles, which are compared with corresponding comparison triangles in model 
spaces with a fixed curvature The model spaces are spheres (/^ > 0), the plane 

{hz = 0) and hyperbolic spaces {hz < 0), and through comparison with these 
spaces, we can bound the curvature of the metric space by hi. In this paper we 
shall use comparison with planar triangles, which gives us curvature bounded 
from above by 0. 

Given a geodesic metric space F, a geodesic triangle abc in Y consists of 
three points a, 6, c and geodesic segments joining them. A planar comparison 
triangle abc for the triangle abc consists of three points a, b, c in the plane, 
such that the lengths of the sides in abc are the same as the lengths of the sides 
in abc, see fig. [?[ 
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Figure 7: A metric space is a CAT{0) space if, for a geodesic triangle abc and 
for any point x on the triangle, the distance from x to the opposite vertex is not 
longer than the corresponding distance in the planar comparison triangle abc. 

A CAT{0) space is a metric space in which geodesic triangles are "thinner" 
than for their comparison triangles in the plane. That is, d{x^a) < ||x — a|| 
for any x on the edge be where x is the unique point on the edge be such that 
d{b^x) — ||b — x|| and d{x,c) = ||x — c||. If the planar comparison triangle 
is replaced by a comparison triangle in the sphere or hyperbolic space of fixed 
curvature hz^ we get a CAT{hz) space. 

A space Y has non-positive curvature if it is locally CAT(O), i.e., if any point 
X G Y has a radius r such that the ball B{x^r) is CAT(O). Similarly, define 
curvature bounded by n as being locally CAT{n). 

Example 31 a) The space U obtained by gluing a family Euclidean spaces 
Ui together along isomorphic affine subspaces Vi C [/^ is a CAT(O) space. 
At any point in U which is not a glued point, the local curvature is 0, 
since the space is locally isomorphic to the corresponding Ui. At any 
glued point, it can be shown that the local curvature is — oo. 



b) The GPCA construction by Vidal et al. [40] defines a CAT(O) space, 
giving a potential use of CAT(O) spaces and metric geometry in machine 
learning. 

c) The space of phylogenetic trees is a CAT{^) space [3^. 

d) As we are about to see, the space of tree-like shapes is locally a CAT{^) 
space almost everywhere. 

One of the main reasons why CAT{n) spaces (and CATifi) in particular) 
are attractive, is due to the following result on existence and uniqueness of 
geodesies. 

Proposition 32 Proposition II 1.4] Let (F, d) he a CAT{k) space. Ifi^<0, 
then all pairs of points have a unique geodesic joining them. For k > 0, the same 
holds for pairs of points at a distance less than iij^/n. 

More results on curvature in metric spaces can be found in the book by 
Bridson and Haefliger [4]. 

3.3 Curvature in the space of ordered tree-shapes - proof 
of theorem [28l 

In this section we study the curvature of tree-shape space using the theory of 
CAT(^) spaces. We show that at a generic tree, the shape space has bounded 
curvature. The results rest on the following theorem: 
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Theorem 33 At a generic point x G X , the shape space is locally CAT{0), and 
thus, X has locally unique geodesies in a neighborhood of x. 

Proof. Recall from section |2.2| how X was formed by identifying subspaces 
Vi C X^ defined in eq. |4] and [Sj These identified subspaces corresponded to 
different representations in X of the same shape x G X. The points in X can 
now be divided into three categories: 

i) points xi e X which do not belong to the image of an identified subspace 
because they only have one representative in X, 

and two classes of points in X which have more than one representative in X. 

ii) The first class contains points X2 G X at which the space X is locally 



homeomorphic to an intersection of linear spaces, as in example 31 a). 
These points are images of points X2 G X which belong to one single 
identified subspace Vi. 

iii) The second class contains points G X whose preimages xs in X are at 
the intersection of identified subspaces Vi^Vj C X. An example of such 
points is the image of the origin in fig. |5] These points correspond to trees 
where, infinitely close to the same tree, we can find pairs of trees in X 
between which geodesies are not unique. 

These three classes of points correspond to local curvature 0, — oc and +00. 
That is, the space is locally CAT{0) at points in categories i); at points from 
ii) it is CAT{hz) for every G M, so has curvature —00; and at points from iii) 
it is not CAT{hz) for any G M; hence the curvature is +00. It thus suffices to 
show that the points in category iii) are non-generic, which follows easily from 
the fact that these must necessarily sit in a lower-dimensional subspace of X. 
The proof carries over to the subspace Z. □ 

Definition 34 (Injectivity neighborhood) We call a CAT{0) neighborhood 
of a point x G X an injectivity neighborhood of x. 

Based on the above, we are now ready to prove: 



Proof (Proof of theorem [28]) . i) The QED case: Since X is locally CAT{0) 
at generic points x, the curvature of X is non-positive in a neighborhood 
U of X. At points x G however, we will always find pairs of points a, b 
arbitrarily close to x with two geodesies joining them, just as in fig. |5] 

ii) The TED case: Consider a tree-shape T G X, represented by a point 
X G X. Induce a second tree-shape T' represented hy x -\- yi -\- y2 G 
X, where ^1,^2 G neG£;(^^)^ such that yi and y2 have one non-zero 
coordinate, found in different edges, which are both nonzero edges in x. 
The topology of is the same as of T. For any n G N, we can find 
n TED geodesies ^1, . . . from T to T', where gi can be decomposed 
diS X X -\- {i/n)yi x -\- {i/n)yi + ^2 ^ ^ + ^1+^2- Thus, there 
are infinitely many TED geodesies from T to T', and (X,Ji) does not 
have locally unique geodesies anywhere. As a consequence, its curvature 
is unbounded everywhere 4, proposition II 1.4]. 
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The practical meaning of theorem 28 is that a) we can use techniques from 



metric geometry to search for QED averages, b) as we are about to see, for 
datasets contained in an injectivity neighborhood, there exist unique means, 
centroids and circumcenters for the QED metric, and c) the same techniques 
cannot be used to prove existence or uniqueness of prototypes for the TED 
metric, even if they were to exist. In fact, any geometric method which requires 



bounded curvature [3j[4 16 wih fail for the TED metric. This motivates our 
study of the QED metric. 

3.4 Curvature in the space of unordered tree-shapes 

It is easy to prove that the same results also hold for unordered tree-shapes: 

Theorem 35 The space (X, J2) of unordered trees with the QED metric is 
generically non-positively curved. With the TED metric di, however, X has 
everywhere unbounded curvature, geodesies are nowhere locally unique and nei- 
ther are any of the types of average tree discussed in this paper. The same holds 



in the subspace Z C X, defined in theorem 15 



3.5 Means and related statistics for tree-hke shapes 

In this section we use what we learned in the previous section to show that, 
given the QED metric on a space of tree-like shapes, we can find various forms 
of average shape in the space of ordered tree-like shapes, assuming that the data 
lie within an injectivity neighborhood. 

There are many competing ways of defining central elements given a subset 
of a metric space. We discuss several: the circumcenter considered in the 
centroid considered, among other places, in 3 , and the mean [I6]. 

The problem of existence and uniqueness of averages can be attacked using 
convex functions. Recall that a function /: [a, 6] ^ R is convex if /((I — s)t + 
St') < (1 - s)f{t) + sf{t') for ah s G [0, 1] and t,t' G [a, 6]. If we can replace 
< with < whenever s g]0, 1[, then / is strictly convex. Convex functions have 
minimizers, which are unique for strictly convex functions. Hence, existence 
and uniqueness of averages can be proven by expressing them as minimizers of 
strictly convex functions. 

We say that a function / : X ^ R on a geodesic metric space X is (strictly) 
convex if for any two points x^y G X and any geodesic 7: [0,/] X from x 
to the function / o 7 is (strictly) convex. We shall make use of the following 
standard properties of convex functions: 

Lemma 36 ///: R ^ R and ^: R ^ R are both convex, g is monotonous 

and increasing, and g is strictly convex, then g o f is strictly convex. 

a) // / : R ^ R and ^: R ^ R are both convex, then ^ + / : R ^ R is also 
convex. If either f or g is strictly convex, then g -\- f is strictly convex as 
well. 

The mean of a finite subset {xi, . . .Xs} in a metric space (X, d) is defined 
as in eq. [l] and is called the Frechet mean. Local minimizers of eq. [T] are called 
Karcher means. 
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The following result follows from a more general theorem by Sturm 34 
Proposition 4.3]; we include the basic version of the proof here for completeness. 



Theorem 37 Means exist and are unique in C AT {0)- spaces. 

Proof. The function dy'.Y^R given by dy{x) = d{x^y) is convex for any 
fixed 7/ G F by [ij Proposition II. 2. 2], so the function dy is strictly convex by 
lemma 36 i). But then D = ^^^i d^. is strictly convex by lemma 36 ii)^ and a 
mean is just a minimizer of the function D. The function D is coercive, so the 
minimizer exists. Since D is strictly convex, the minimizer is unique. □ 

We also consider two other types of statistical "prototypes" for a dataset, 
namely circumcenters and centroids. These are both well-known in the context 
of metric geometry and CAT{i<i) spaces. 

Definition 38 a) Circumcenters. Consider a metric space (F, d) and a 
bounded subset Z cY. There exists a unique smallest closed ball B{cz^ rz) 
in Y which contains Z; the center cz of this ball is the circumcenter of Z . 

b) The centroid of a finite set. Let X be a uniquely geodesic metric space 
(a metric space where any two points are joined by a unique geodesic). 
The centroid of a set 5 C X of n elements is defined recursively as a 
function of the centroids of subsets with n — 1 elements as follows: Denote 
the elements of by 5i, . . . , s^. If 5* contains two elements, l^l = 2, the 
centroid c{S) of S is the midpoint of the geodesic joining si and S2. If 
l^l = n, define c^{S) = {c{S') : = n - 1}, which is a set with n 
elements. Similarly, for larger /c, c^{S) = c^{c^~^{S)). All these sets have 
n elements. If the elements of c^{S) converge to a point c G X as /c ^ oo, 
then we say that c = c{S) is the centroid of 5* in X. 

Based on the theory of CAT{f<i) spaces and our results for means, we have 
for the set of tree-like shapes: 

Theorem 39 Endow X with the QED metric d2. A generic point x G X has 
a neighborhood U such that sets contained in U have unique means, centroids 
and circumcenters. 

The same statistical properties also hold for the QED metric on unordered 
tree-shapes:_ Generic points in the space of unordered tree-shapes with the QED 
metric (X, 6^2) have neighborhoods within which means, circumcenters and cen- 
troids exist and are unique. 

For the TED metric, these are not unique. 

The same results hold in the restricted tree-spaces Z and Z, defined in defi- 
nition{Tl\ and theorem[TEl 
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and 
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Proof. First consider X and X with the QED metric. By theorems 
X and X are both locally CAT{0) spaces, and by theorems 10 and ISlthey are 
both complete metric spaces. 



We have seen in theorem 37 that means exist and are unique in CAT{0) 
spaces, so the statement holds for means. By 4, proposition 2.7], any subset 
F of a complete CAT{0) space has a unique circumcenter. Hence, the state- 
ment holds for circumcenters. Similarly, by ^ theorem 4.1], finite subsets of 
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Figure 8: Two trees joined by two different geodesies in X. 



CAT{{)) spaces X have centroids (unique by definition), so the statement holds 
for centroids. 

We turn to the TED metric. By definition, for any 2-point dataset, ah these 
notions of mean reduce to finding the midpoint of a geodesic connecting the 
two points. We know that geodesies and midpoints are not unique in the TED 
metric. This ends the proof. □ 

3.6 The injectivity neighborhood 

We have shown that the local curvature is nonpositive almost everywhere in 
X, which makes X well suited for geometric definitions of statistical proper- 
ties. However, our notion of "almost everywhere" is strongly tied to (maximal) 
dimensionality, which again is strongly tied to the topological structure of the 
maximal tree 3^ . One consequence is that the injectivity neighborhoods in X 
are rather small, as we are about to see through examples. In this section we 
impose natural constraints on tree-space that allow us to increase the size of the 
injectivity neighborhoods, and make a conjecture for future expansion beyond 
the use of CAT(0)-spaces. 

Consider the following two examples; we thank the anonymous reviewer for 
the first! 

Example 40 1) Consider the tree-shapes Ti and T2 shown in fig. [sj left. 
These two tree-shapes are joined by two geodesies, and thus Ti and T2 are 
not contained in the same injectivity neighborhood in X. However, in a 
suitably chosen Z, they can be. 

2) Consider the tree-shape T in the space of unordered tree-shapes with 
attributes in R^, spanned by the maximal tree ^ as in fig. [9j right. Ar- 
bitrarily close to T we will find two trees Ti and T2 which are joined by 
two geodesies, as shown in the figure. 

As a illustrated by these examples, the CATifi) injectivity neighborhoods in 
the shape-space X can be very small, mainly containing trees whose topology is 
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the same as 3^ . However, we can obtain much larger injectivity neighborhoods 
by restricting to the natural subspaces Z (Z X and Z C X as in definition [iT] As 
shown in theorem 39 , 2 and 2 have the same nice geometric properties as X and 
X, and Z can be chosen so that the injectivity neighborhoods are bigger than 
in X and X by avoiding situations as in the examples above. Radius is not a 
good measure for the size of an injectivity neighborhood, as a tree may contain 
both small branches, which do not have much room to vary, as well as large 
branches, which will be allowed to move more throughout such a neighborhood. 
However, any convex neighborhood which does not contain points of curvature 
+00, will be an injectivity neighborhood. 

All the results from section [33] hold at generic points, within an injectivity 
neighborhood where the CAT(^) property holds. We have seen examples of 
points where local CAT(0)-property must fail, illustrating the limitaitons of 
CAT(O) techniques for analyzing general trees. However, most of the situations 
where the CAT(O) property fails are highly non-generic, and we conjecture that 
more general results can be proven: 

Conjecture 41 For a generic set of points X ^ X , Z or Z , means 

exist and are unique. 



3.7 Comparison of QED and TED 

As we have seen in theorems [28j the QED metric gives locally non-positive 
curvature at generic points, while the TED metric gives unbounded curvature 
everywhere on X, X, Z and 2. Equivalently, geodesies are locally unique almost 
everywhere in the QED metric, while nowhere locally unique in the TED metric. 
As emphasized by theorem [39| this means that we cannot imitate the classical 
statistical procedures on shape spaces using the TED metric, while for the QED 
metric, we can. 

Note, moreover, that the QED metric is the quotient metric induced from 
the Euclidean metric on the pre-shape space X, making it the natural choice of 
metric seen from the shape space point of view. 

From a computational point of view, the TED metric has nice local-to- 
global properties. If the trees Ti and T2 are decomposed into subtrees Ti^i,Ti^2 



21 




Figure 10: (a) Local-to-global properties of TED: di(Ti,T2) = t/i(Ti,i, T2,i) + 
c^i(^i,2 7 ^2,2)- (b) Two options for structural transition in a path from Tree 1 
to Tree 2. ' 



and T2,i,T2,2 as in fig. 10(a), such that the geodesic from Ti to T2 restricts 
to geodesies between Ti^i and T2,i as well as Ti^2 and T2,2, then d{Ti^T2) = 
(i(Ti^i, T2,i) + <i(^i,2 7 ^2,2)- Many dynamic programming algorithms for TED 
use this property, and the same does not hold for the QED metric, due to the 
square root. 

For a qualitative comparison of QED and TED, we compare the geodesies 
defined by the TED and QED metrics between small, simple trees. Consider the 
two tree-paths in fig. |10(b)[ where the edges are endowed with scalar attributes 
a, 6, c, e G M+ describing edge length. Computing the costs of the two different 
paths in both metrics, we find the shortest (geodesic) path. 

Path 1 indicates a matching left and right side edges c and while Path 2 
does not make the match. The cost of Path 1 is 2e in both metrics, while the 
cost of Path 2 is 2Vc^ + (P in the QED metric and 2(c + d) in the TED metric. 
Thus, TED will identify the c and d edges whenever < + 2cd + d^, while 
QED makes the identification whenever < \{c^ d^). Thus, TED will be 
more prone to internal structural change than QED. 

The same occurs in the comparison of TED and QED matching in figs. |ll(a)| 
and |ll(b)| Although the TED is more prone to matching trees with different 
tree-topological structures, the edge matching results are similar, as is expected, 
since the metrics are closely related. 



4 Computation and complexity 

In general, computational complexity is a problem for both TED and QED. 
Computing TED distances between unordered trees is NP-complete 43 , and 
we conjecture that the QED metric is NP-complete as well. We can, however, 
often use geometry and prior knowledge (e.g., anatomy) to find efficient approx- 
imations. Trees appearing in applications are often not completely unordered, 
hut are semi-labeled. 

Example 42 (Semi- labeling of the upper airway tree) Most airway trees 
have similar, but not necessarily identical, topological structure, where several 
branches have names and can be identified by experts. 
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(a) Matching in the QED metric. 
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(b) Matching in the TED metric. 



Figure 11: Given a set of five data trees, we match each to the four others in 
both metrics. 
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The top generations of the airway tree serve very clear purposes in terms of 
anatomy. The root edge is the trachea; the second generation edges are the left 
and right main bronchi; the third generation edges lead to the lung lobes. As 
these are easily identified, we find a semi-labeling of the airway tree, and use it 
to simplify computations of inter-airway geodesies in section |5.2[ 



4.1 QED approximation and implementation 

Since the decomposition strategy used for dynamic programming in TED is not 
available for QED, an approximation of QED is used in our experiments. Many 
anatomical trees have a somewhat fixed overall structure. For these, it is safe 
to assume that the number of internal structural transitions found in a geodesic 
deformation is low, and that the geodesies pass through identified subspaces of 
low codimension. The latter assumption is equivalent to assuming that the trees 
appearing throughout the geodesic deformation have nodes of low degree. For 
instance, for the airway trees studied in section [5^ below, we find empirically 
that it is enough to allow for one structural change in each lobar subtree, which 
has at most a trifurcation. Recall the definition of the metric from eq. [7j the 
approximation consists of imposing upper bounds K on the number k and D 
on the degrees of internal vertices appearing in eq. [7| respectively, giving: 

dapprox,K{x,y) = inf <^ ^d{xi,yi)\xi ex.yi^ ^i+i,y/c \ , (43) 

k<K ' ^ I 

- U=i ) 

where all Xi and yi have vertex degrees at most D. Geometrically, k is the num- 
ber of Euclidean segments concatenated to form the geodesic. Bounding k is 
equivalent to bounding the number of internal topological transitions through- 
out the geodesic. In fact. A: = 1 + the number of internal topological transitions 
in the geodesic. 

All edges are translated to start at G M^, and represented by a fixed number 
of landmark points (in our case 6) evenly distributed along the edge, the first 
at the origin. The distance between two edge attributes vi,V2 G (M.^)^ is the 
Euclidean distance \\vi — V2\\. 

We approximate of the QED metric using Algorithm [ij see Appendix B in 
the supplemental material for details. The complexity of Algorithm 1 is 

0{n^D-^^^^-^^) • 0{optim{n,K)), 

where n is the number of internal vertices, K is the bound on /c, D is the 
bound on vertex degree and optim{n^ K) is the optimization in line 8. For the 
unordered airway trees in section [5^ we combine Algorithm 1 with a complete 
search through the set of branch orderings. Computing QED distances through 
a complete search is not optimal, and finding more exact and efficient algorithms 
is a nontrivial research problem. 



5 Experimental results 

The QED metric is new, whereas the properties of the TED metric are well 



known 31 . Our experimental results on real and synthetic data illustrate the 



geometric properties of the QED metric. The experiments on airway trees in 
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Algorithm 1 Computing approximate QED distances between ordered, rooted 
trees with up to k = K — 1 structural transitions through trees of order at most 

D 

1: X, y planar rooted depth n binary trees 

2: S = {S} set of ordered identified pairs S = {81,82} of subspaces of X 
corresponding to internal topological changes through trees of order at most 
corresponding to a subspace S of X, s.t. if {81,82} G S, then also 
{S2,Si}eS. 

3: for S = {S\ ...,S'}CS with 1^1 <kdo 

4: for e with representatives p\ G SI and G ^2 

5: p= {p^,p^,...,p'') 

6: f{p)=min{d2{x,pl) + j:'^Zld2{pi,p{^')+d2{phy)} 
7: end for 

8: d^ = mm< f{p) ^ ^ 



9: 
10: 
11: 
12: 
13: 
14: 



8 = {8\...,8^} j 

Ps = {Pi^P2}i=i = argmin/(p) 
end for 

^^ = min{d^|^cS, 1^1 <k} 
P = {Pi,P2}Li = {Ps\ds = d} 

geodesic = g = {x ^ p\ ^ pi ^ Pi ^ P2 ^ • - • ^ Pi ^ P2 ^ v} 
return d, g 



section [5^ show, in particular, that it is feasible to compute geodesies between 



real, 3D data trees. 

5.1 Synthetic planar trees of depth 3 

Movies illustrating geodesies between planar depth 3 trees, as well as a table 
illustrating a tree-shape matching experiment for a set of synthetic planar depth 
3 trees, are found on the web page http : / / image . diku . dk/ aasa/tr ee_shape7' 
^planar . htm l , The movies and matchings show the geometrically intuitive be- 
havior of the geodesic deformations. We see that the intermediate structures 
resemble the geodesic endpoint trees in a reasonable manner, and show the abil- 
ity of the geodesies to handle internal topological differences. These are among 
the wanted properties from the model listed in the beginning of section [2j 

Moreover, computed mean and centroid trees are shown in fig. [12] these are 
presented in greater detail in 8 . The QED average trees clearly represent the 
main common properties of the dataset trees. 

5.2 Results in 3D: Pulmonary airway trees 

As proof-of-concept experiments on real data we compute means for sets of pul- 
monary airway trees from the EXACT'09 airway segmentation competition [ 22] . 
The airway trees were first segmented from low dose screening computed to- 
mography (CT) scans of the chest using a voxel classification based airway 
tree segmentation algorithm [2l]. The centerlines were extracted from the seg- 



mented airway trees using a modified fast marching algorithm based on 30 



The method gives a tree structure directly through connectivity of parent and 
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Figure 12: Top: A small set of synthetic planar trees. Bottom left: The mean 
tree. Bottom right: The centroid tree. 




Figure 13: Subtrees of 14 airway trees from the EXACT'09 test set. 22 



children branches. For simplicity, we only consider the upper airway tree down 
to the lobe branches. 

Means are computed using both the QED metric (details found in ^) and 
using the TED method proposed by Trinh and Kimia [37]. The dataset is shown 



in fig. [13) and the results are seen in fig. 14 We clearly see how the choice of 



TED geodesic makes the TED mean vulnerable to noise in the form of missing 
branches among the dataset trees, giving a mean shape whose structure is too 
simple. The QED mean, on the other hand, represents the data well. 



6 Discussion and conclusion 

Starting from a purely geometric point of view, we define a shape space for tree- 
like shapes. This intuitive geometric framework allows us to simultaneously take 
both global tree-topological structure and local edgewise geometry into account. 
We study two metrics on this shape space, TED and QED, which both give 



the shape space a geodesic structure (theorems 10 and 15). The framework is 
developed for tree-shapes, but carries over to other geometric trees with vector 
attributes. 

QED is the geometrically most natural metric, and turns out to have prop- 



QED TED 




Figure 14: The QED and TED mean airway trees. The TED mean misses the 
upper lobe branches on both sides, caused by the fact that the 13th data tree 
misses these. 
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erties which are essential for statistical shape analysis. Through a geometric 
analysis of the tree-shape framework, we show that the QED metric gives local 
uniqueness of geodesies and local existence and uniqueness for three versions 
of average shape, namely the mean, the circumcentre and the centroid (theo- 
rem 39). Our analysis gives new insight into the metric space defined by TED, 



and explains why the problem of finding TED-averages is ill-posed. 

Both metrics are generally NP complete to compute for 3D trees. We ex- 
plain how semi-labeling schemes can be used to handle complexity problems, 
and illustrate this by computing QED means for sets of trees extracted from 
pulmonary airway trees as well as synthetic planar data trees. The QED lacks 
an obvious decomposition strategy for dynamic programming, and involves an 
optimization term which makes it more time consuming to compute than the 
TED metric. The two metrics have similar matching properties, and thus, for 
applications which do not require a unique geodesic (e.g., clustering) the TED 
metric is just as suitable as the QED metric. However, for statistical computa- 
tions, the QED metric is the only suitable metric. This is emphasized by our 
experiments, where computed QED and TED means illustrate that the TED 
means are missing important structure. 

Future research will be centered around two points: Development of non- 
linear statistical methods for the singular tree-shape spaces, and finding more 
efficient approximations and heuristics for the QED metric using both the tree 
geometry and the tree-space geometry. The latter will pave the road for com- 
puting averages and modes of variation for large, real 3D data trees. This is by 
no means trivial, due to the non-smooth structure of the tree-shape space and 
the complexity of computing exact distances and geodesies. 



Appendices 

A Measures induced on quotient spaces 

The purpose of this appendix is to explain the connection between genericity, 
defined as a topological property, and probability. In particular, we explain 
why a generic property can be assumed to hold with probability one, or almost 
everywhere, with respect to natural probability measures. The main point of 
the appendix is the contents of Theorem [53j 

Definition 44 (Generic property) A property in a topological space X is 
said to be generic if it holds on a dense, open subset of X. 

Definition 45 (cr-algebra) A collection A = {U} of subsets of some set X is 
a (j-algebra if A has the following properties: 

• X eA. 

• If U G A, then its complement is also an element: G A. 

• If Ui G A for a family i = 1 . . . oo, then the union is also an element: 
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Definition 46 (Measure) A measure on a a-algebra A is a function jii: A ^ 
M>o satisfying the a-additivity property: if {/7^}i=i..oo is a countable family of 
pairwise disjoint subsets in A, then /i (Ui^i ^i) = Xli^i /^(^i)- 

Definition 47 (Almost everywhere) A property is said to hold almost ev- 
erywhere with respect to a probability measure on a space X if it holds on 
a subset U C X such that /i{X \U) = 0. This corresponds to holding with 
probability one, or almost surely. 

Assume given a metric space (X, d), and a- algebra A which contains the 
topology induced by (i, and measure /i on the sigma- algebra A. Assume also 
given an equivalence class ~ on X, and denote by X = X/ ^ the quotient of X 
with respect to the equivalence. Denote by tt : X ^ X the projection onto the 
quotient. 

Lemma 48 The set A = {U C X|7r-^(f7) G A} is a a -algebra on X. 
Proof. • TT-^X = X eAsoX eA. 

• UtJ eA then TT-W eAso ir-^U^) = {tt-W)^ G A, so G A. 

• IftJi e AJoT i = 1 . . . oo then TT-^t/^ G A for aU i, so tt'^ (U,^i Ui) = 

[JT=i^~'UieAso[JT=iUieA. □ 

Lemma 49 The function ft: A ^ ]R>o given by fi{U) = fi{7r~^{U)) is a mea- 
sure on A. 

Proof. If {Ui}'^i is a countable family of pairwise disjoint subsets in A, then 
{7r~^Ui}'^i is ^ countable family of pairwise disjoint subsets in A, and fl (Ui^i ^i) 

M {^-' (u=i m = (u=i i^-'m = E,=i i^-'Ui) = EZi A m. □ 

Proposition 50 There are Lebesgue induced measures fhx and fhx on the 
tree-spaces X and X , coming from the Lebesgue measure mx on X. For these 
measures, subsets of positive codimension have measure zero. In particular, 
generic properties hold almost everywhere with respect to the Lebesgue induced 
measure. □ 

Recall that we also work on a restricted subspace Z C X which consists of 
trees of certain fixed topologies that are collapsed versions of the maximal tree 
3^. The subspace Z is a finite union 

n 
i=l 

where each Zi is a (i- dimensional linear subspace of X corresponding to a certain 
fixed collapsed version of £^ described by its nonzero edge set Ei C E. We 
assume, moreover, that Zi ^ Zj whenever i ^ j. Each Zi now has a Lebesgue 
measure mi of its own, and we obtain a Lebesgue induced measure mz on Z 
defined as 

n 

mz{U) = Y,^i{Uf^Zi). 

i=l 

The measure mz gives rise to Lebesgue induced measures on Z and Z as ex- 
plained above. 
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Proposition 51 There are Lebesgue induced measures fhz and fhz on the tree- 
spaces Z and Z, coming from the Lebesgue induced measure mz on Z. If all 
Zi have equal dimension, subsets of positive codimension have measure zero. In 
particular, generic properties hold almost everywhere with respect to the Lebesgue 
induced measure. □ 

Definition 52 ^29| p. 120] Let /i and A be two measures on a a-algebra A. 
The measure A is absolutely continuous with respect to if and only if X{U) = 
for each U eU with /i(/7) = 0. 

Theorem 53 Let X be a probability measure which is absolutely continuous with 
respect to the Lebesgue induced measure fa on one of the tree-space X or X, 
or of Z or Z under the assumption that all Zi have equal dimension. Now any 
subset of positive codimension has zero measure with respect to X. In particular, 
generic properties hold almost surely, or with probability 1, with respect to X. \3 

Remark 54 Note that the converse is not necessarily true: Properties which 
hold almost surely are not necessarily generic. 



B Implementation of approximate QED geodesic 
computation 

In this appendix, we extend the explanation of how the approximate QED 
distance and geodesic computation specified in section 4.2 in the main paper 
can be made in practice. We shall consider computation of distances in the 
space of ordered trees X, as the complete search reduction from ordered trees 
to the unordered tree-space X is not hard to implement. 

Recall that the tree-space X is constructed from the Euclidean space 

X= fJ(M'^)^, 

eeE 

consisting of all trees spanned by a certain maximal combinatorial tree ^ = 
(V, ^,r). The construction is made by identifying different representations of 
the same tree, with the help of an equivalence relation ~. Denote by x the 
equivalence class of a point x e X. As discussed in the article, there is a 
standard way of defining a distance function on X = X/ ~, given by the quotient 
pseudometric 

d{x,y) = inf |^ ex.yi^ ^i+i,^/c ^ y| , (55) 

which is shown to be an actual metric. 

In the paper, we suggest approximating the geodesic by bounding the number 
of visited equivalence classes: 

inf/c<K ^Yli=id{xi,yi)\xi e x,yi ^ ^i+i,^/c ^ y| , 

and in practice, in our experiments, we have used an approximation with K 
either 2 or 3. 
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B.l Reformulation of tree-space identifications in terms 
of tree isomorphism 

The quotient space construction is extremely useful for understanding the ge- 
ometry of tree-space and proving statistical properties of tree-space based on 
geometry. However, it is less intuitive how one should proceed in order to actu- 
ally compute distances in tree-space. In this section, we shall explain how the 
tree-space gluings are related to the trees themselves. 

Recall that two points in X are equivalent under ~ and glued together in 
the formation of X if they represent the same ordered tree-shape. How can 
we determine whether two points in tree-space represent the same ordered tree- 
shape? This reduces to a tree isomorphism problem. More precisely: 

i) Assume given two points Xi,X2 G X. 

ii) These points correspond to two combinatorial trees ^1,^2, equal to or de- 
rived from where some of the edges in ti may be endowed with zero 
attributes in Xi. 

iii) Create two new combinatorial trees t^^, ^2 by deleting the edges in ti and ^2 
with zero attributes in xi and X2, and using the induced orders on siblings 
from ti, t2 to induce total orders on siblings in t'2. 

iv) The question of whether xi and X2 end up in the same identified subspace 
is now equivalent to the question of whether t'^ and t'2 are isomorphic as 
ordered, combinatorial trees. The question of whether xi and X2 represent 
the same tree-shape, is equivalent to the question of whether a) t'-^ and t'2 
are topologically isomorphic, and b) if yes, whether their ordered sets of 
attributes are identical. 

v) The observations above give us a way to judge whether two points in X are 
equivalent under the equivalence relation on X; equivalently, whether 
they are identified in X. This realization can be used to reformulate 
Algorithm 1 from the main paper in terms of the trees rather than in 
terms of the tree-space. 

vi) A geodesic in X is composed of a set of linear stretches in X between 
equivalence classes that belong to lower-dimensional identified subspaces. 
Each such stretch can change one endpoint tree to the other endpoint tree 
by i) gradually shrinking and removing edges, ii) adding and gradually 
extending edges, or iii) gradually changing edges which are "present" in 
both trees. All these changes are made simultaneously at constant speed 
throughout the linear stretch. 

vii) Given two points xi and X2 in X, how can we find the shortest path 
joining them which consists of two Euclidean stretches? The length of the 
shortest such path is (iapprox, 2(^1, ^2)- We shall look at this problem in 
section IB. 21 below. 

B.2 Paths with two EucUdean stretches 

In this section we study the problem of finding dapprox,2' 
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y 




w 



delete internal collapsing edges from xi, 
that is, belonging to E]^^. 
(interpolate to zero attribute) 



grow out internal collapsing edges from X2, 
that is, belonging to £^2"'*. 
(interpolate from zero attribute) 



deform all edges appearing in w 



Figure 15: Path consisting of two Euclidean stretches. 

• Let xi, X2 be tree-shapes in X. Let y be some tree-shape whose topological 
structure can be obtained as a collapsed version of both xi and X2. 

• The topology of the tree y is obtained from xi (and similarly from X2) by 
collapsing two types of edges: 

— internal edges (that is, there is some edge in the tree which is further 
from the root in xi (or ^2), which is not being collapsed), or 

— external edges (that is, all the edges in xi (or X2) which are farther 
from the root are also collapsed). 

• Denote the sets of such edges as E]^^ , Ef^^ , respectively. 

• We are looking for a point w ^ X which contains some such ^ as a subtree, 
such that the shortest two-stretch path from xi to ^2 passes through 
and consists of the following types of deformations: 

— deformation of all edges that collapse to an edge in y] throughout the 
whole path. 

— gradually deforming all external collapsing edges in so that when 
we reach ^2, their attributes have become zero; throughout the whole 
path. 

— gradually growing all external collapsing edges in X2 from zero at- 
tributes when the deformation starts in xi\ throughout the whole 
path 

— gradually deforming all internal collapsing edges in xi so that when 
we reach their attributes are zero; throughout the first stretch. 

— from gradually grow all internal collapsing edges in X2] throughout 
the second stretch. 

— Thus, w contains the edges from z as well as the edges from Ef^^ and 
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Now the distance from xi to X2 via w is: 

dist{xi^X2,w) = 




■ 2. Il(5^2)e|| = 

eeEl 



int 



length of the first stretch from xi to w 
lY^J{X2)e-ZeP+ ^ ||(X2)e|| = 



length of the first stretch from w to x^ 

where \\{xi)e — ^e|| = ||^e|| in case e is not an edge in x^, i = 1,2. 



• This is ihustrated in figure [15 

• Now we can define d approx, 2'- 

^approx, I 

= min ((iapprox, i(^i, ^2), min^ dist{xi, X2,w)) , 

where w loops through the set of ah trees generated from some tree y 
which is a cohapsed version of both xi and X2. 

Remark 57 (Paths with one Euclidean stretch) The simplest possible path 
between two tree-shapes does not visit any identified subspaces, and has length 
dapprox,!- We havc not in an obvious way taken into account that this might in 
fact be the shortest path. In order to see that this is part of the computation, 
let us start out as above: 

• Let xi, X2 be tree-shapes in X. Let y be some tree-shape whose topological 
structure can be obtained as a collapsed version of both xi and X2. 

• The topology of the tree y is obtained from xi (and similarly from X2) by 
collapsing two types of edges: 

— internal edges (that is, there is some edge in the tree which is further 
from the root in xi (or ^2), which is not being collapsed), or 

— external edges (that is, all the edges in xi (or X2) which are farther 
from the root are also collapsed). 

• Assume that it is possible to find some tree y which is obtained by only 
collapsing external edges. 

• The shortest path passing through the corresponding w will be a single 
Euclidean stretch, which can be obtained as described above. 

• Hence, the option that dapprox,i might give the optimal path is part of the 
computation of dapprox,2' 

Remark 58 There is a chance that the intermediate trees appearing in this 



algorithm are "too large" for X (for example as in fig. 15 We have chosen to 
disregard this, and cut the intermediate trees off in iterative algorithms if they 
are too large. This can also be controlled by cutting iv off by default. 
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We still have not addressed the question of where the collapsed tree y comes 
from. 

• In order to compute d appr ox, 2{^i-, ^2)1 we need to search through all dif- 
ferent collapsed versions y of xi and ^2. 

• In order to reduce computation time, we approximate this by setting a 
bound D on the degree of vertices of the collapsed versions. 

• For D = 3, the number of such subtrees is 0(n), where n is the number of 
edges in the tree. For I) > 3, the number of such subtrees is 0(n*^^~^^). 
In our experiments we use D = ?>. 



B.3 Paths with several Euchdean stretches 



Note that if we can answer the question in vii) in the beginning of section B.l 
then we can compute dapprox.K as well: 

dapprox,K{^l^ — 
jj^lYi } _ d appr ox, K-l{^_l 1^2)1 

\ min^ {d 

approx, I ^approx, K—1 

where w goes through a family of trees derived from all possible collapsed trees 
y obtained from xi. 

Remark 59 This approach can be adapted to computation in a more restricted 
space Z by making sure that all the intermediate trees are contained in Z. 
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